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ABSTRACT  Based  on  visual  perception  principles,  this 
article  studies  computational  models  and  algorithms  associated 
with  the  extraction  and  tracking  of  moving  targets  in  complex 
images.  It  analyzes  and  tests  the  effects  of  changing  relevant 
model  parameters  on  image  segmentation  thresholds.  What  1S 
different  from  certain  conventional  algorithms  is  that  new 
methods  make  an  integrated  consideration  of  object-background 
conditions,  visual  nonlinearities,  interframe  correlativity  and 
differences  into  a  two  step  integrated  target  extraction  process 
including  three  criteria  and  a  fast  optimizing  procedure.  Object 
tracking  makes  use  of  binary  template  matching.  Experimental 
results  are  given  for  visible  light  image  sequences. 
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GRAPHICS  DISCLAIMER 


All  figures,  graphics,  tables,  equations,  etc.  merged  into  this 
translation  were  extracted  from  the  best  quality  copy  available. 


I .  INTRODUCTION 

Real  time  moving  object  extraction  is  an  important  problem 
in  such  fields  as  pattern  recognition,  image  processing,  machine 
vision,  guidance,  and  so  on.  In  normal  tracking  systems,  there 
are  two  basic  operations,  that  is,  object  image  segmentation  and 
correlation  matching.  When  operating  in  complex  image  sequences 
and  altered  environmental  conditions,  the  steadiness  of  these 
operations  and  the  complexity  of  computations  directly  influence 
system  performance.  However,  most  previous  research  work  was 
limited  to  relatively  simple  image  sequences,  thus  limiting  the 
range  of  applications  associated  with  the  research  results. 

Object  image  segmentation  methods  which  are  normally  used  are 
based  on  image  gray  level  information  and  include  Bayesian 
decision  making  methods,  White  Sands  target  range  segmentation 
methods,  optimum  entropy  segmentation  methods,  as  well  as 
clustering  criterion  methods  presented  by  Otsu,  and  so  on  [1-5]. 
These  methods  each  have  their  advantages  and  applicable 
conditions.  Existing  problems  are:  (i)  lack  of  capability  to 

automatically  set  up  appropriate  segmentation  windows  and  /47 
areas;  (ii)  based  on  absolute  gray  or  absolute  gray  variations, 
it  is,  therefore,  very  difficult,  under  conditions  associated 
with  complex  backgrounds  and  environmental  changes,  to  achieve 
object  extraction  results  with  steady  performance.  Attention  is 
paid  to  the  object  extraction  of  human  vision  under  complex 
background  environment  conditions,  tracking  flexibility,  and 
adaptive  performance.  This  article  takes  certain  principles 
among  these  and  introduces  them  into  the  study  of  these  problems 
in  order  supply  the  methodological  foundation  to  develop— m 
complex  pictures— adaptive  object  extraction  and  tracking 

systems . 
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II.  ADAPTIVE  OBJECT  EXTRACTION 
1.  Object  Background  Conditions 

Conventional  object  image  segmentation  methods  assume  that 
areas  awaiting  segmentation  (whole  images  or  sub  images)  contain 
adequate  object  and  background  information.  However,  in  a  number 
of  actual  applications,  these  assumptions  cannot  be  guaranteed 
before  the  fact.  This  creates  one  of  the  causes  for  unstable 
object  extraction  algorithm  performance.  Taking  segmentation 
criteria  put  forward  by  Ostu  as  an  example  [4],  one  obtains 
criteria  associated  with  optimum  segmentation  threshold  to*  as 

(1) 

0  )=max<ffKi)}  (2) 


in  the  equations,  threshold  t  takes  image  elements  f(x,y)  and  ^ 
divides  them  into  object  class  Cl  and  background  class  C2 .  ■<* 

is  the  variance  between  classes,  ©j  is  the  relative  area 
associated  with  the  class  C j .  mj  is  the  average  degree  of  gray 
associated  with  the  class  C j .  As  far  as  j=(l,2)  is  concerned, 
they  are  both  functions  of  threshold  t.  max  is  selecting  maximum 

value  operations » 

Kittler  and  others  discovered  that,  in  images,  when  the 
differences  of  ratios  between  objects  and  backgrounds  are  very 
large,  2  (t)  can  possibly  not  be  a  single  peak.  Even  if  it 

is  a  single  2  (max)  peak,  it  still  does  not  definitely 
correspond  to  accurate  (segmentation  result)  thresholds.  1 
can  appear  in  the  vicinity  of  average  gray  values  associated  with 
classes  counting  relatively  greatly  in  proportions  [5].  As  is 
shown  in  Fig.l,  in  case  (a),  areas  accounted  for  by  object  and 
background  ©1  and  ©2  are  comparable.  In  case  (b),  ©1  «  ©2  . 
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Moreover,  in  case  (c),  u2  «  col  .  Obviously,  case  (a)  is  the 
condition  that  any  segmentation  algorithm  hopes  to  satisfy. 
However,  in  case  (b),  algorithms  will  take  the  relatively 
numerous  background  image  elements  and  determine  them  to  be 
targets.  In  case  (c),  algorithms  can  take  a  portion  of  target 
image  elements  and  read  them  as  b'ckground.  This  is  because, 


Fig. 1  A  Few  Typical  Object  Background  Conditions 


normally,  characteristic  values  associated  with  objects  such  as 
gray  values— are  not  absolutely  uniform.  The  object  in  case  (c) 
occupies  almost  the  entire  segmentation  area.  Therefore,  certain 
nonuniform  parts  of  objects— speaking  in  terms  of  algorithms— 
seem  like  they  have  changed  into  "object"  and  the  remaining  part 
seems  to  have  changed  into  "background".  Case  (b)  can  be 
analyzed  in  a  similar  way.  Therefore,  Kittler  and  others 
discovered  that  one  of  the  causes  of  this  problem  ought  to  be 
that,  when  segmentation  criteria  are  made  use  of  and  areas  have 
not  satisfied  proper  conditions  for  object  and  background, 
segmentation  algorithms  with  stable  performance  should  include  a 
step  associated  with  the  automatic  discrimination  and  adjustment  of 
segmentation  areas.  Attention  is  paid  to  human  vision— in  the 
process  of  apprehending  pictures— being  able  to  flexibly  alter 
fixation  points  and  the  size  of  fixation  areas.  Fixation  points 
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are  primarily  distributed  in  locations  where  information  is 
relatively  abundant  and  characteristics  are  relatively  striking 
[6].  Taking  this  principle  and  introducing  it  into  object 
extraction  processes,  it  is  possible  to  obtain  a  simple 
computational  model  associated  with  changing  fixation  points  and 
areas  in  order  to  achieve  optimum  segmentation  areas .  Simple 
rational  criteria  for  estimating  whether  information  in  a  certain 
area  is  abundant  or  not  can  be  intuitively  deduced  from  Fig.l. 

In  case  (a),  segmentation  area  characteristics  such  as  gray 
variance  contrasts  are  relatively  large.  However,  on  the  basis 
of  uniformity  within  classes,  the  variance  contrasts  associated 
with  areas  in  cases  (b)  and  (c)  are  relatively  small.  Letting 
the  size  of  segmentation  area  Q(xl,yl)  be  LxW,  (xl,yl)  is  the 
center  coordinate  in  the  area  in  question.  The  mean  value  of 
image  element  f(x,y)  in  Q  is  mo.  Variance  o 2  is 

tn  w/s 

^  =  2  '2  </<*»+* O', +3>) -m0)*  (3) 

j  ui  wn 

2  2  /(■*»+* (4) 

Letting  L,  W,  and  (x,y)  be  variables,  the  size  and  location  of 
areas  awaiting  segmentation  Q  can  change.  o2  is  a  function  of  L, 
W,  and  (xl,yl).  Because  of  this,  criteria  to  find  optimum 
segmentation  areas  can  be  written  as 


oLAL-  ,W  Ur  ,y{  )  =  max  {cr(L.  IV)  ! jrt .  v, }  (5) 

Lo- AL^L^Lo^AL 

xo  —  xi  <xo  +  Az 

yO  —  Ay^  y  l  <  yo  ■+>  A y 
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In  the  equations,  Lo,  Wo,  xo,  and  yo  are  original  values  of 
variables.  AL,  AW,  Ax,  and  Ay  are  search  radii.  L*,  W*,  (x*  ,y’ ) 

are  the  length,  width,  and  position  coordinates  segmentation  area 
A*  . 

2.  Visual  Nonlinearity  and  Constancy 

Segmentation  is  the  initial  perception  process  associated 
with  objects  and  backgrounds.  Let  Am  =  Am2  -  mlA.  Then, 
equation  ( 1 )  can  be  written  as 

flJ(/)=tu1(f)w2(r)A»2*0)  (6) 


Without  losing  generality,  Am  can  be  seen  as  the  absolute 
difference  in  brightness  between  object  Cl  possessing  average 
brightness  ml  and  background  C2  possessing  average  brightness  m2 
Experiments  in  the  biology  and  psychology  of  vision  point  out 


Fig. 2  Relationship  Between  AmT  and  m 
Key:  (1)  Zone 
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that,  in  backgrounds  of  different  brightnesses,  equal  absolute 
differences  in  brightness  are  certainly  not  able  to  give  rise  to 
equal  visual  responses.  Vision  capabilities  of  distinguishing 
against  Am  follow  along  with  increases  in  background  brightness  m 
and  decrease.  If  one  uses  the  brightness  difference  AmT  where 
perception  is  just  possible  to  characterize  this  type  of 
perception  capability,  then  AmT  follows  increases  in  m  and 
increases  [7].  As  is  shown  in  Fig. 2,  the  relationship  between 
AmT  and  m  is  nonlinear .  It  can  be  divided  into  the  dark 
adaptation  zone,  the  Devries-Rose  zone,  and  the  Weber  zone,  being 
approximately  represented  as 


Amr— 


,1/2 


<r,wz"-,  Devries-Rose  K^cmA- 
a.m,  Weber 


(7) 


In  the  equations,  al  and  a2  are  appropriate  constants.  Obviously, 
criterion  functions  associated  with  equation  (6)  are  dependent  on 
changes  in  absolute  degrees  of  gray.  Therefore,  they  do  not 
match  up  with  visual  nonlinearities.  Besides  this,  equations  (3) 
£2  (5)  are  average  measurements  of  nonuniformities  within  A. 
Nonlinearity  is  also  a  type  of  brightness  change.  If  one  makes 
use  of  equations  (3)  -  (5)  as  criteria,  then  perceptions  of 
nonlinearity  are  only  related  to  average  absolute  brightness 
changes  within  the  area.  Therefore,  there  exist  problems  of  the 
same  kind.  If  one  takes  the  concept  of  background  brightness  and 
generalizes  it  to  include  average  brightness  associated  with  an  ^ 
area,  then  equation  (3)  can  be  revised  in  order  to  reflect  visual 
nonlinearity  principles 


Lfl 


LXW^ 


,AiJ  *•<-> 


(8) 
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In  equations,  So (mo)  is  the  increasing  function  associated  with 
mo.  For  example,  it  is  possible  to  select  the  form 


[62m0+pt  rn^mT 


(9) 


mT  is  the  predetermined  brightness  threshold. 
(2)  are  revised  to  be 


of  (/)  =  CU1(f)£U2(f)  ( 


m;(f) 


}* 


Equations  ( 1 )  and 


(10) 


aZ^Cti'  )=max{cu, 


(ID 


In  the  equations,  SI  (mj,mo)  is  the  increasing  function 
associated  with  mj  .  As  far  as  ('j=l , 2 )  are  concerned,  it  is 
possible  to  select  the  forms  below  /49 

‘  O-i  Ao-i+O+i,  <12a> 

U:(w?,+c)+62 

(12b) 


Above,  the  forms  of  So  (.)  and  SI  set  up  a  conceptual 
relationship  with  Devries-Rose  zones  and  Weber  zones,  thereby- 
considering,  in  calculations,  using  the  parameters  01,  Pi,  02,  P2 
and  al,  bl,  a2,  b2  to  control  the  strength  of  the  influences  of  mo 
and  mj  on  segmentation  process  nonlinearities.  The  parameter  c 
can  be  used  to  control  the  influences  in  changes  of  environmental 
illumination  on  segmentation  processes.  Equations  (8)  -  (12) 
reflect  the  concepts  of  perceptual  relative  brightness  changes. 


During  actual  calculations,  in  the  end,  option  is  made  for  mj  = 
ml  or  mj  =  m2.  As  far  as  what  sort  of  influence  the  parameters 
a j ,  b j ,  and  c  have  on  optimized  threshold  t*  is  concerned,  it  is 
necessary  to  carry  out  analysis  and  experimental  research. 

Comparing  the  differences  between  optimum  thresholds 
obtained  from  equation  (2)  and  equation  (11) —  and 

- — it  is  possible  to  prove  that  ml  and  m2  are  both  monotonic 
increasing  functions  of  the  threshold  variable  t  .  Therefore, 
Sl(mj,mo)  is  also  a  monotonic  increasing  function  of  t  .  Giving 
it  a  little  thought,  it  is  possible  to  know  that  the  introduction 
of  ml  and  m2  has  a  tendency  to  make  *»’  lower  than  /0* 

This  is  advantageous  to  the  separation  of  relatively  small 
objects  from  backgrounds  in  low  contrast  imagery.  It  is  possible 
to  prove  that,  when  t  changes,  the  range  of  changes  in  ml  is 
[ 0 , mo ] .  The  range  of  changes  in  m2  is  [mo, 255].  Letting  S(mj,mo) 
=  5f  (mj ,mo) ,  consider  the  characteristics  of  weighting 
function  l/S(mj,mo).  For  example,  make  al  =  1,  c  =  0,  bl  =  1 . 
Then,  S (ml )  =  ml  +  1.  S(m2)  =  m2  '+  1.  What  Fig.  3  shows  is  the 

1/S(mj )  curve.  As  far  as  1/S(ml)  is  concerned,  values  are 
selected  in  the  zone  [0,mo].  For  1/S (m2),  values  are  selected  in 
the  zone  [mo, 255].  Obviously,  the  rate  of  change  associated  with 
the  former  function  as  well  as  the  range  ojf  changes  are  both 
greater  than  the  latter.  Therefore,  the  influence  of  ml  on 
equation  (10)  ‘  s  •<&  and  thus  the  influence  on  optimized 


Fig. 3  Weighting  Function  1/S(mj) 
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threshold  ti  is  greater  than  the  influence  of  m2.  Normally, 
it  is  possible  to  make  *>’  drop  even  lower.  The  effects  of 
parameter  bl  is  equivalent  to  shifting  the  zone  of  selected 
values  associated  with  1/S(mj)  along  the  horizontal  axis.  When 
bl  increases,  the  zone  of  selected  values  shifts  toward  a 
location  where  the  curve  is  relatively  flat.  The  influences  of 
the  variable  mj  on  1/S(mj)  gradually  decrease.  At  this  time,  the 
characteristics  associated  with  equation  (11)  gradually  approach 
conventional  Otsu  criteria.  This  article  designates  bl  as  an 
inertial  factor.  The  effects  of  al  lie  in  changing  the  form  of 
1/S(mj),  thereby  altering  the  rate  of  change  and  the  scope  of 
change,  producing  effects  on  of  •  When  SI  takes  the  form  of 
(12-b),  due  to  the  fact  that,  with  the  same  parameters,  the  rate 
of  change  and  the  scope  of  change  associated  with  1/S(mj)  are 
generally  both  greater  than  SI  taking  the  form  of  equation  (12- 
a),  therefore,  the  influences  of  the  former  on  are  also 

larger  than  the  latter.  Analysis  .and  experiments  verify  that,  m 
order  to  effectively  take  objects  and  separate  them  from 
backgrounds,  making  use  of  mj  =  ml  is  often  better  than  m2. 

The  brightness  constancy  associated  with  human  vision 
systems  can  be  represented,  within  a  certain  range,  using  what  is 
shown  in  Fig. 4  of  shifts  in  light  sensor  optical  strength 
response  curves  [8].  Let  I  stand  for  input  light  strength.  R  is 
sensor  response.  Rmax  is  maximum  saturation  response.  K  is  the 
input  optical  strength  corresponding  to  0.5  Rmax.  Then, 

|  R=R^(I/(J-hK))  (13) 

in  the  Fig.,  response  curves  shift  from  K1  to  K2 .  The  influences 
of  ambient  optical  illumination  changes  on  whole  images  or  local 
segmentation  areas  can  use  mean  brightness  increment  Amo  for 
representation 


(14) 


m  j=mjJr&m<> 


As  a  result,  optical  illumination  changes  are  equivalent  to 
shifting  the  1/S (mj ,mo)  value  selection  zone,  creating 
instability  in  t*  values  selected.  Introducing  parameter  C  to 
compensate  for  this  type  of  change  causes  optimum  thresholds  not 
to  be  influenced  by  environmental  changes  or  to  be  influenced 
very  little.  Environmental  illumination  changes  are  AI .  If  A I 
on  whole  images  or  local  areas  is  approximately  uniform,  and  the 
influences  of  these  changes  still  lie  within  sensor  dynamic 
ranges,  then  imagery  statistical  characteristic  changes  are 
reflected  as  left  right  shifts  along  the  grayness  axis  in 
grayness  histograms.  After  reference  points  have  been  determined 
beforehand,  from  these  shifts,  it  is  possible  to  calculate  /50 
variations  of  in  equation  (14).  Under  different  conditions, 

in  order  to  make  imagery  obtained  for  the  same  object  capable — on 
a  unified  basis — of  making  use  of  'standard  methods  to  acquire 
segmentation  results  associated  with  stable  performance,  it  is 
possible  to  opt  for  the  use  of  standard  transformations  of 
histogram  starting  points  to  partially  eliminate  the  effects  of 
AI .  For  example,  a  transformation  making  the  effective  histogram 
origin  to  be  the  zero  gray  level  operates  as  follows.  Under 
certain  optical  illumination  conditions,  the  histogram  for 


Fig. 4  Optical  Strength  Response  and  Brightness  Constancy 
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imagery  obtained  is  p(i),  i=0 , 1 , 2 , . . 255 .  Probabilities  from 
gray  level  0  to  gray  level  k  are  Pk  =  Setting  up  an 

adequately  small  probability  threshold  Ak  ,  if  Pk  <  Ak  ,  then,  m'j 
=  mj  -  k.  m'o  =  mo  -  k.  After  histogram  transformations  p'(0)  = 
Pk  ,  p' (i-k)  =  p(i) ,  i  >  k.  If  Pk  =  0,  then,  the  operations 
discussed  above  are  transformations  without  distortion  for  Al  > 

0.  Otherwise,  there  is  distortion.  At  this  time,  c  =  **k. 

On  the  basis  of  the  analysis  discussed  above,  two  of  the 
causes  for  the  problems  discovered  by  Kittler  and  others  ought  to 
be  Otsu  segmentation  criteria  not  possessing  visual  nonlinearity 
and  constancy  mechanisms.  The  methods  in  this  article  resolve 
this  problem  relatively  well.  They  are  capable  of  taking  mean 
values  associated  with  optimum  thresholds  from  the  vicinities  of 
classes  with  relatively  large  surface  areas  and  pulling  them  out 
with  controllability . 

3.  Two  Step  Process  Associated  with  Object  Imagery  Segmentation 

Adaptive  object  imagery  segmentation  can  be  summarized  as  a 
process  of  two  organically  related  steps. 

i)  On  the  basis  of  the  criterion  functions  specified  in 

association  with  equations  (5),  (8),  and  (9),  optimum  segmentation 

areas  Q  are  obtained. 

ii)  On  the  basis  of  criteria  specified  in  association  with 
equations  (10)-(12),  binary  segmentation  is  implemented  on  A*  to 
solve  for  optimum  thresholds 


With  regard  to  object  tracking,  initial  windows  are  normally 
manually  determined  when  tracking  starts,  or  automatically 
determined  by  correlation  matching  times.  Therefore,  (xl,yl)  is 
given.  Equation  (5)  then  simplifies  to  be 


I 


<£»(L*  ,W'  !*,,v.)  = 


max 

^  STS' aw 


[<r(.L,W  |xi  .>))} 


(15) 
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With  regard  to  the  automatic  extraction  of  unknown  small 
objects  from  large  fields — because  there  is  interest  in  the  upper 
and  lower  limits  on  the  sizes  of  object  images--it  is  normally 
possible,  on  the  basis  of  other  information  (for  example, 
distances  relative  to  the  camera)  to  make  rough  determinations 
before  the  fact.  Therefore,  in  equation  (5),  initial  values 
associated  with  the  size  of  segmentation  areas  and  variation 
ranges  can  be  set  or  dynamically  altered  in  order  to  guarantee 
the  relative  stability  of  criterion  characteristics  concerned. 
Whole  images  are  taken  and  divided  up  to  be  certain  initial  sub 
zones  (It  is  permissible  for  parts  to  overlap.).  Then,  the 
center  of  the  object  must  fall  in  a  certain  sub  zone.  The  center 
of  the  sub  zone  in  question  can  then  act  as  the  initial  values  for 
equation  (5)  coordinate  variables.  Coordinate  variation  range 
can  be  set.  Because  of  this,  it  is  possible  to  distinguish 
between  various  sub  zones,  using  equation  (5)  to  obtain 
corresponding  optimum  segmentation  areas .  Experiments  verify 
that  these  methods  can  normally  produce  reasonable  segmentation 
areas.  Equation  (5)  is  not  the  only  criterion  which  can  be  used. 
Other  criteria  (including  those  possessing  intelligent  decision 
functions)  are  worth  further  study. 

Equation  (5)  is  simple  and  capable  of  recurrence 
calculations.  The  amounts  of  calculations  added  to  search 
reasonable  segmentation  zones  are  not  great.  In  particular,  as 
far  as  the  extraction  of  unknown  objects  in  large  fields  is 
concerned,  the  amounts  of  calculations  are  primarily  related  to 
the  number  of  sub  zones.  For  example,  with  regard  to  512x512 
imagery,  use  is  made  of  32x32  initial  sub  zones  on  VAX11-730 
computers.  The  time  to  carry  out  target  extraction  is 
approximately  6  seconds.  Making  further  use  of  inherent  parallel 
characteristics  in  the  method,  it  will  possess  real  time 
application  potential. 

4.  Experimental  Results 

Experiments  were  carried  out  on  microcomputer  image 
processing  systems.  Fig. 5  is  segmentation  results  associated 
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with  making  use  of  Otsu  criteria  on  a  few  images.  In  it,  (a)  and 
(b)  are  original  object  images  and  their  histograms.  (c)  and  (d) 
are  segmentation  results  as  well  as  curves  for  variations  of  ai 
as  a  function  of  threshold  t  .  (e),  (f),  (g),  and  (h)  are  actual 

results  for  a  different  image.  It  can  be  seen  that,  due  to  the 
fact  that  whole  images  do  not  satisfy  appropriate  object 
background  conditions,  objects  of  interest  are  not  able  to  be 
separated  from  backgrounds.  (i)  is  sub  zones.  Although  they 
basically  satisfy  appropriate  object  background  conditions, 
except  for  one  image,  in  the  other  two  images,  there  is  still  no 
capability  to  obtain  accurate  results.  Fig. 6  (a)  and  (b)  are 
test  results  associated  with  the  methods  of  this  article.  They 
are  clearly  superior  to  conventional  methods.  In  experiments,  we 
set  the  initial  window  size  as  50x50.  The  optimum  window  sizes 
obtained  for  searches  of  two  images  were,  respectively,  37x37  /51 
and  70x70.  When  control  parameter  bl  values  increase,  of  (t) 
curve  variations  are  seen  in  (c)  and  (d)  respectively. 

Obviously,  optimum  thresholds  t{  '  follow  increases  in  bl  and 
increases.  In  conjunction  with  this,  there  is  an  approximation 
of  conventional  Otsu  criteria  thresholds. 

III.  SEQUENCED  IMAGE  OBJECT  EXTRACTION  AND  TRACKING 

1 .  Fast  Threshold  Estimates 

In  order  to  increase  calculation  speeds  associated  with 
sequenced  image  object  extraction,  it  is  necessary  to  adequately 
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Fig.  6  Experimental  Results  for  the  Methods  of  This  Article 
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make  use  of  interframe  correlativities .  However,  object 
extraction  and  tracking  in  actual  environments  still  /52 
require  considering  differences  between  frames  created  by  various 
types  of  interference  and  changes.  Letting  the  statistical 
histogram  associated  with  the  segmented  window  of  the  previous 
frame  be  pn-1  (i) ,  the  statistical  histogram  of  the  current  frame 
area  awaiting  segmentation  is  pn(i),  i=0,l,2, . . . ,255.  Then,  the 
relationship  between  the  statistical  characteristics  of  the  two 
frames  can  be  expressed  as 


255 

2  A  />(/)  =  0 


(16) 

(17) 


In  the  equations,  Ap(i)  stands  for  changes  in  statistical 
characteristics  between  frames.  As  far  as  *'  ( t)  changes  given 

rise  to  by  Ap(i)  are  concerned,  they  lead  to  variations  m 
optimum  thresholds  associated  with  various  frames. 


„  mr1)+Am.-mrl)-Am1_y 

tf2(f)<*>  =  [a4*-,>+Aa)1][a4-1>+A^JL5i(m<.-i>+Amy,m(»-‘>+Am0) 

=  0?G)<o'n+  A^G) 

t;  <»)-*;  (n-D+A^r  (19) 

In  the  equations,  the  superscripts  n  and  n-1  stand  for  the 
present  frame  and  the  previous  frame.  Making  use  of  interframe 
correlativity,  the  optimum  threshold  t*(n-l>  associated  with  the 
(n-l)th  frame  can  act  as  the  predicted  value  for  optimum 
threshold  estimates  associated  with  the  nth  frame  window. 
Therefore,  optimizing  processes  do  not  need  to  be  carried  out  m 

the  entire 
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Fig. 7  Object  Extraction  Model  Schematic 

Key:  (1)  Search  Area  and  Qn  "differentiation  (3)  Optimum 

(2\'  Object/Backgroundy  Coj?d^  .  frame  Difference  Analysis  (5) 
'  .  j  w  /  a  \  Histogram  and  In  .  Solving  for  Optimum 

Window  Setting  (')  ^  -.2-  /q\  Delay 

r^afhoir^/se^aticJn  (6,  Segmentation  Resuits 

However,  at  this  time,  .the  determination  of  the 
gray  range.  Howe  interframe  differences.  The 

search  range  must  cons  associated  with  the 

-  m-iterion  function  e  assocxa 

definition  of  the  enteric 

measurement  of  differences  is 


e=f)|A/>(Ol 


(20) 
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point,  in  Fig. 7,  this  article  presents  an  adaptive  sequenced 
image  object  extraction  model  possessing  controllability. 

2 .  Binary  Template  Matching 

One  type  of  often  used  algorithm  associated  with  object 
tracking  is  template  matching.  Here,  templates  are  objects  in 
previous  frame  tracking  windows,  and  matching  searches  are  then 


Fig. 8  Sequenced  Image  Object  Tracking  Results 

carried  out  within  an  area  of  specially  designated  size  inside/53 
the  current  frame,  acquiring  the  optimum  object  center  positron 
within  the  area  in  question.  Gray  level  template  matching  is  a 
often  used  matching  method,  despite  the  fact  that  there  are 
various  types  of  fast  algorithms  [9-11].  However,  direct  gray 
level  template  matching  does  not  possess  stability  wit  regar 
environmental  optical  illumination  conditions  and  interference. 
On  the  foundation  of  object  extraction  which  this  article 
already  described  previously,  option  is  made  for  the  use  o 
binary  template  matching.  Search  areas  also  make  use  of 
previously  discussed  two  step  segmentation  processes  to  acquire 
binary  characteristic  imagery.  The  object  extraction  methods  of 
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this  article  possess  relatively  good  adaptability  with  regard  to 
environmental  and  background  changes,  providing  a  foundation  for 
the  reliability  of  subseguent  binary  matching  operations. 
Moreover,  binary  matching  calculations  are  realized  with  simple 
and  easy  to  use  specialized  hardware,  possessing  real  time 
application  value.  Experiments  are  carried  out  on  microcomputer 
imagery  processing  systems.  Table  1  is  partial  results  obtained 
with  optimal  threshold  sequences  during  object  extraction  and 
tracking  on  three  image  series.  In  them,  partial  tracking 
results  for  the  first  two  series  of  images  are  shown  in  Fig. 8. 
From  right  to  left  and  from  top  to  bottom,  the  objects  in 
sequenced  images  move  from  right  to  left.  Tracking  windows  are 
formed  automatically.  Tracking  results  are  very  good. 

Table  1  Optimal  Threshold  Sequences 


1 

2 

3 

4 

5 

6 

7 

'8 

9 

10 

11 

12 

13 

14 

15 

16 

I 

8 

9 

7 

7 

7 

5 

6 

6 

7 

7 

7 

7 

6 

6 

7 

6 

I 

70 

72 

71 

70 

71 

71 

70 

70 

70 

56 

56 

56 

58 

59 

58 

57 

I 

11 

10 

10 

10 

11 

11 

11 

10 

11 

11 

11 

10 

11 

11 

10 

11 

(1) 


Image  Sequence 


IV.  CONCLUSIONS 


This  article  analyzes  certain  problems  associated  with 
conventional  object  extraction  and  tracking  algorithms,  pointing 
out  that  the  causes  of  instability  in  object  extraction  algorithm 
performance  lie  in  not  possessing  differentiation  and  adjustment 
capabilities  with  regard  to  object  background  conditions. 
Moreover,  they  are  normally  based  on  absolute  gray  levels  or 
absolute  gray  level  changes.  On  the  basis  of  visual  perception 
principles,  we  put  forward  one  type  of  effective  sequenced  image 
object  extraction  model  and  algorithm,  analyzing  and  empirically 
demonstrating  the  effects  of  control  parameters  in  models  on 
segmentation  threshold  selection  and  segmentation  performance. 
Making  use  of  sequenced  image  interframe  correlativity  and 
differences,  it  is  possible  to  automatically  set  up  initial 
values  and  search  ranges  associated  with  threshold  estimates.  On 
this  foundation,  binary  template  matching  and  tracking  are 
carried  out.  In  experiments  on  sequenced  visible  light  images 
possessing  complex  backgrounds,  verification  is  done  of  the 
accuracy  and  broad  application  value  of  this  article's  methods. 
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